A Parallel Processing System for Polyphonic Singing Synthesis
نویسندگان
چکیده
A synthesiser has been designed for polyphonic (multivoice) singing synthesis. Singing synthesis requires frequent updating of parameters to achieve a natural output sound. These parameters include fundamental frequency, formant frequencies, formant bandwidths, and voice source parameters (such as Open Quotient). The synthesis engine is controlled using a script and a graphical user interface. The hardware consists of several SHARC ADSP-21060 processors, an FPGA and a multi-channel D/A convertor. This paper presents an overview of the synthesiser and the synthesis model employed. The results show a soundfile which has been created using a synthesis-by-analysis approach.
منابع مشابه
Automatic Recognition of Lyrics in Singing
The paper considers the task of recognizing phonemes and words from a singing input by using a phonetic hidden Markov model recognizer. The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music. Due to the lack of annotated singing databases, the recognizer is trained using speech and ...
متن کاملSinging Pitch Extraction from Monaural Polyphonic Songs by Contextual Audio Modeling and Singing Harmonic Enhancement
This paper proposes a novel approach to extract the pitches of singing voices from monaural polyphonic songs. The hidden Markov model (HMM) is adopted to model the transition between adjacent singing pitches in time, and the relationships between melody and its chord, which is implicitly represented by features extracted from the spectrum. Moreover, another set of features which represents the ...
متن کاملAutomatic Transcription of Flamenco Singing Melodic Transcription of Flamenco Singing from Monophonic and Polyphonic Music Recordings
We propose a method for the automatic transcription of flamenco singing from monophonic and polyphonic music recordings. Our transcription system is based on estimating the fundamental frequency (f0) of the singing voice, and follows an iterative strategy for note segmentation and labelling. The generated transcriptions are used in the context of melodic similarity, style classification and pat...
متن کاملModeling of Phoneme Durations for Alignment between Polyphonic Audio and Lyrics
In this work we propose how to modify a standard approach to text-to-speech alignment for solving the problem of alignment of lyrics and singing voice. To this end we model the duration of phonemes, specific to the case of singing. We rely on a duration-explicit hidden Markov model (DHMM) phonetic recognizer based on mel frequency cepstral coefficients (MFCCs), which are extracted in a way robu...
متن کاملImproving Polyphonic Melody Extraction by Dynamic Programming Based Dual F0 Tracking
The suitability of optimal path finding methods for vocal melody extraction in polyphonic music is well recognized since they combine local pitch strength and temporal smoothness considerations in a global sense. However, when such single-F0 tracking systems are applied to sound mixtures in which pitched accompaniment is of comparable strength to the singing voice, they suffer from irrecoverabl...
متن کامل